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ABSTRACT 

The  accepted  definition  of  frequency  stability  in  the  time  domain  is  the  two-sample 
variance  (or  Allan  variance).  It  is  based  on  the  measurement  of  average  frequencies 
over  adjacent  time  intervals,  with  no  “dead  time”  between  the  intervals.  The  primary 
advantages  of  the  Allan  variance  are  that  (l)  it  is  convergent  for  many  actual  noise 
types  for  which  the  conventional  variance  is  divergent;  (2)  it  can  distinguish  between 
many  important  and  different  spectral  noise  types;  (3)  the  two-sample  approach  relates 
to  many  practical  implementations,  for  example,  the  rms  change  of  an  oscillator’s 
frequency  from  one  period  to  the  next;  and  (4)  Allan  variances  can  be  easily  estimated 
at  integer  multiples  of  the  sample  interval. 

In  1974  a  table  of  bias  functions  which  related  variance  estimates  with  various  config¬ 
urations  of  number  of  samples  and  dead  time  to  the  two-sample  (or  Allan)  variance 
was  published^ 1 1 .  The  tables  were  based  on  noises  with  pure  power-law  power  spectral 
densities. 

Often  situations  recur  which  unavoidably  have  distributed  dead  time  between  mea¬ 
surements,  but  still  the  conventional  variances  are  not  convergent.  Some  of  these 
applications  are  outside  of  the  time  and  frequency  field.  Also,  the  dead  times  are 
often  distributed  throughout  a  given  average,  and  this  distributed  dead  time  is  not 
treated  in  the  1974  tables. 

This  paper  reviews  the  bias  functions  B1(N,  r,/i),  and  B2(r,n)  and  introduces  a  new 
bias  function,  53(2,r,/i),  to  handle  the  commonly  occurring  cases  of  the  effect  of  dis¬ 
tributed  dead  time  on  the  computed  variances.  Some  convenient  and  easy  to  interpret 
asymptotic  limits  are  reported.  The  actual  tables  will  be  published  elsewhere  in  book 
form. 


INTRODUCTION 

The  most  common  statistics  used  are  the  sample  mean  and  variance.  These  statistics 
indicate  the  approximate  magnitude  of  a  quantity  and  its  uncertainty.  For  many  situations 
a  continuous  function  of  time  is  sampled,  or  measured,  at  fairly  regular  instants.  The 
sampling  process  (or  measurement)  is  not  normally  instantaneous  but  takes  a  finite  time 
interval  to  give  an  “average  reading”.  If  the  underlying  process  (or  noise)  is  random  and 
uncorrelated  in  time,  then  the  fluctuations  are  said  to  be  “white”  noise.  In  this  situation, 
the  sample  mean  and  variance  calculated  by  the  conventional  formulas, 
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provide  the  needed  information.  The  “bar”  over  the  y  in  (1)  denotes  the  average  over  that 
finite  time  interval.  As  in  science  generally,  the  physical  model  determines  the  appropriate 
mathematical  model.  For  the  white  noise  model  the  sample  mean  and  variance  are  the 
mainstays  of  most  analyses. 

Although  white  noise  is  a  common  model  for  many  physical  processes,  other  noises  are 
becoming  increasingly  common.  In  the  field  of  precise  time  and  frequency  measurement, 
for  example,  there  are  two  quantities  of  great  interest:  instantaneous  frequency  and  phase. 
These  two  quantities  by  definition  are  exactly  related  by  a  differential.  (We  are  NOT 
considering  Fourier  frequencies  at  this  point.)  That  is,  the  instantaneous  frequency  is  the 
time  rate  of  change  of  phase.  Thus,  if  we  were  employing  a  model  of  white  frequency  noise 
(white  FM)  this  is  logically  equivalent  to  phase  noise  which  is  the  integral  of  the  white 
noise —  commonly  called  a  Brownian  motion  or  a  random  walk.  Depending  on  whether  we 
are  currently  interested  in  phase  or  frequency  the  sample  mean  and  variance  may  or  may 
not  be  appropriate. 

By  definition,  white  noise  has  a  power  spectral  density  (PSD)  which  does  not  vary  with 
Fourier  frequency.  Since  random  walk  noise  is  the  integral  of  a  white  noise,  it  can  be 
shown  that  the  power  spectral  density  of  a  random  walk  varies  as  l//2  (where  /  is  a 
Fourier  frequency).  We  encounter  noise  models  whose  power  spectral  densities  are  various 
power-laws  of  their  Fourier  frequencies.  Flicker  noise  is  a  very  common  noise  and  is  defined 
as  a  noise  whose  power  spectral  density  varies  as  1//  over  a  significant  spectral  range.  If 
an  oscillator’s  instantaneous  frequency  is  well  modeled  by  a  flicker  noise,  then  its  phase 
would  be  the  integral  of  the  flicker  noise.  It  would  have  a  PSD  which  varied  as  l//3 . 

Noise  models  whose  PSD’s  are  power  laws  of  the  Fourier  frequency  but  not  integer  expo¬ 
nents  are  possible  but  not  common.  This  paper  considers  power-law  PSD’s  of  a  quantity 
y(i);  y(t)  is  a  continuous  sample  function  which  can  be  measured  at  regular  intervals.  For 
noises  whose  PSD’s  vary  as  /“  with  a  <  -1  at  low  frequencies,  the  conventional  sample 
mean  and  variance  given  in  (1)  diverge  as  N  gets  large!2,3!.  This  divergence  renders  the 
sample  mean  and  variance  nearly  useless  in  some  situations. 

Although  the  sample  mean  and  variance  have  limited  usefulness,  other  time-domain  statis¬ 
tics  can  be  convergent  and  quite  useful.  The  quantities  which  we  consider  in  this  paper 
depend  greatly  on  the  details  of  the  sampling.  Indeed,  each  sampling  scheme  has  its  own 
bias,  and  this  is  the  origin  of  the  bias  functions  discussed  in  this  paper. 


THE  ALLAN  VARIANCE 

As  a  result  of  the  recognition  that  the  conventional  sample  variance  fails  to  converge  as  the 
number  of  samples,  N,  grows,  Allan  suggested  that  one  can  set  N  =  2  and  average  many 
of  these  two-sample  variances  to  get  a  convergent  and  stable  measure  of  the  spread  of  the 
quantity  in  question!3!.  This  is  what  has  come  to  be  called  the  Allan  variance. 

More  specifically  let’s  consider  a  sample  function  of  time  as  indicated  in  Fig.  1.  A  mea¬ 
surement  consists  of  averaging  y(t)  over  the  interval,  r.  The  next  measurement  begins  a 
time  T  after  the  beginning  of  the  previous  measurement  interval.  There  is  no  logical  reason 
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why  T  must  be  as  large  as  r  or  larger  —  if  T  <  r  then  the  second  measurement  begins  before 
the  first  is  completed,  which  is  unusual  but  possible.  When  T  —  t  there  is  no  “dead  time” 
between  measurements. 

The  accepted  definition  of  the  Allan  variance  is  the  expected  value  of  a  two-sample  variance 
with  NO  dead  time  between  successive  measurements.  In  symbols,  the  Allan  variance  is 
given  by: 

*l(r)  =  l/2 J57[(yx  -y2)2]  (2) 

where  there  is  no  dead  time  between  the  two  sample  averages  for  the  Allan  variance  and 
the  E[  ]  is  the  expectation  value. 


THE  BIAS  FUNCTION  (iV,  r,  /*) 

Define  N  to  be  the  number  of  sample  averages  of  y(t)  used  in  (l)  to  estimate  a  sample 
variance  (N  =  2  for  an  Allan  variance).  Also  define  r  to  be  the  ratio  of  r  to  r  (r  =  l  for 
no  dead  time  between  measurements).  The  parameter  //  is  related  to  the  exponent  of  the 
power-law  of  the  PSD  of  the  process  y(t).  It  has  been  shown  that  if  a  is  the  exponent  in 
the  power-law  spectrum  for  y(i) ,  then  the  Allan  variance  varies  as  r  raised  to  the  n  power, 
where  a  and  /t  are  related  as  shown  in  Fig.  2l2-4l.  One  can  use  estimates  of  //  to  infer  a, 
the  spectral  type.  The  ambiguity  in  a  for  /t  =  -2  has  been  resolved  by  using  a  modified 


Often  data  cannot  be  taken  without  dead  time  between  sample  averages,  and  it  might 
be  useful  to  consider  other  than  two-sample  variances.  We  will  define  the  bias  function 
•Si  ( N ,  r,  n)  by  the  ratio: 


B\  (N ,  r,  fi) 


a2(N ,  T,r) 
a2(2,r,T) 


(3) 


where  g2(N,T,t)  is  the  expected  sample  variance  given  in  (l)  above,  based  on  N  mea¬ 
surements  at  intervals  T  and  averaged  over  a  time  r  and  r  =  T/r.  In  words,  Bi(N,T,y,)  is 
the  ratio  of  the  expected  variance  for  N  measurements  to  the  expected  variance  for  two 
samples  (everything  else  held  constant).  Implicitly,  the  variances  on  the  right  in  (3)  do 
depend  on  the  noise  type  (the  value  of  //  or  a)  even  though  not  shown  as  an  independent 
variable.  N,  r  and//,  are  shown  as  a  independent  variables  for  all  of  the  bias  functions  in 
this  paper  because  the  values  of  the  ratio  of  these  variances  explicitly  depends  on  /i  as  will 
be  derived  later  in  the  paper.  Allan  showed  that  if  N  and  r  are  held  constant,  then  the 
a,  n  relationship  shown  in  Fig.  2  is  the  same;  that  is,  one  can  still  infer  the  spectral  type 
from  the  r  dependence  using  the  equation  a  =  -/t  -  1,  -2  <  n  <  2l3h 


THE  BIAS  FUNCTION  J52(r,/t) 

The  bias  function  B2(r,/i)  is  defined  by  the  relation 


B2  (r,  m) 


g2(2,  T,  t)  _  g2(2,  T,t) 

<r2(2,r,r)  g2(t) 


(4) 


In  words,  B2(r,n)  is  the  ratio  of  the  expected  variance  with  dead  time  to  that  without  dead 
time  (with  N  =  2  and  r  the  same  for  both  variances).  A  plot  of  the  B2{r,y)  function  is 
shown  in  Fig.  3.  The  bias  functions  Bi  and  B2  represent  biases  relative  to  N  =  2  rather 
than  infinity;  that  is,  the  ratio  of  the  N  sample  variance  (with  or  without  dead  time)  to  the 
Allan  variance  and  the  ratio  of  the  two-sample  dead-time  variance  to  the  Allan  variance. 
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THE  BIAS  FUNCTION  B3{N,M,r,n ) 

Consider  the  case  where  measurements  are  available  with  dead  time  between  each  pair 
of  measurements.  The  measurements  were  averaged  over  the  time  interval  tq,  and  it  may 
not  be  convenient  to  retake  the  data.  We  might  want  to  estimate  the  Allan  variance  at, 
say,  multiples  of  the  averaging  time  r0.  If  we  average  groups  of  the  measurements  of  y{t), 
then  the  dead  times  between  each  initial  measurement  are  distributed  throughout  the  new 
average  measurement  rather  than  accumulating  at  the  end  (see  Fig.  4).  Define: 


M  +  i-l 

Vi  =  (VM)  »»•  (5) 

n=t 

where  yn  are  the  raw  or  original  measurements  based  on  dead  time  Ta  -  tq. 

Also  define  the  variance  with  distributed  dead  time  as: 

a2  (2  ,MT0,Mt0)  =  imfa-Vi+u)%  (6) 


where  r  —  Mr0  and  T  =  MTa. 

We  can  now  define  B3  as  the  ratio  of  the  TV-sample  variance  with  distributed  dead  time  to 
the  TV-sample  variance  with  dead  time  accumulated  at  the  end  as  in  Fig.  1: 


•B3  (TV,  M,r,n) 


ar2{N,  MT0,  Mt0) 
a2(TV,T,r) 


(7) 


Although  B3(N,  M,  r,  fi)  is  defined  for  general  TV,  the  tables  here  confine  treatment  to  the 
case  where  TV  =  2.  There  is  little  value  in  extending  the  tables  to  include  general  TV,  and 
the  work  becomes  excessive.  Though  the  variance  on  the  right  in  (7)  depend  explicitly 
on  TV,  T  snf  r,  the  ratio  B3(N,M,R,/j.)  can  be  shown  to  depend  on  the  ratio  r  =  T/r,  and 
explicitly  on  n  as  developed  later  in  this  paper. 


In  words,  B3(2,M,r,fi)  is  the  ratio  of  the  expected  two-sample  variance  with  distributed 
dead  time,  as  shown  in  Fig.  4,  to  the  expected  two-sample  variance  with  all  the  dead 
time  grouped  together  as  shown  in  Fig.  1.  Both  the  numerator  and  the  denominator  have 
the  same  total  averaging  time  and  dead  time,  but  they  are  distributed  differently.  The 
product  B2(r,n)  ■  B3{2,M,r,n)  is  the  distributed  dead-time  variance  over  the  Allan  variance 
for  a  particular  T,  r,  M  and  fi. 


Some  useful  asymptotic  forms  of  B3  can  be  found.  In  the  case  of  large  M  and  M  >  r,  we 
may  write  that: 

B3  —  ,  1<M<2, 


(8) 


B  -■  4ln(2) 

3  21n(r)  +  3’ 


/i  =  0. 


One  simple  and  important  conclusion  from  these  two  equations  is  that  for  the  cases  of 
flicker  noise  FM  and  random  walk  noise  FM  the  dependence  is  the  same  whether  or  not 
there  is  distributed  dead  time  for  the  asymptotic  limit.  The  values  of  the  variances  only 
differ  by  a  constant.  This  conclusion  is  also  true  for  white  noise  FM,  and  in  this  case  the 
constant  is  1. 


In  the  case  r  >>  1,  and  -2  <  n  <  -1,  we  may  write  for  the  asymptotic  behavior  of  B3: 


B3  Ma,  a  =  — #x  —  1. 


(9) 


230 


In  this  region  of  power-law  spectrum  the  Bz  function  has  an  Ma  dependence  for  an  fa 
spectrum. 

THE  BIAS  FUNCTIONS 

The  bias  functions  can  be  written  fairly  simply  by  first  defining  the  function: 

F{A)  =  2A'J+2  -  (A  +  l)"+2  -  |(A  -  1)|M+2  (10) 

The  bias  functions  become: 


Bi(N,  r,  n) 


l  +  (l/2)F(r) 


(11) 


B3(2,M,r<  pi) 


B2(r,  m) 


1  +  (1/2)  F(r) 
2(1  —  2^)  ’ 


(12) 


2 M"+2  +  F(MR)  ■  M  -  ~  «)[2F(nr)  -  (F(M  +  n)r)  -  F(M  -  n)r)] 

(M"+2)[F(r)  +2]  '  1  ’ 


For  /  =  0,  Equations  (11),  (12),  and  (13)  are  indeterminate  of  form  0/0  and  must  be 
evaluated  by  L’Hospital’s  rule.  Special  attention  must  also  be  given  when  expressions  of 
the  form  0°  arise.  We  verified  a  random  sampling  of  the  table  entries  using  noise  simulation 
and  Monte  Carlo  techniques.  No  errors  were  detected.  The  results  in  this  paper  differ 
somewhat  from  those  in  Ref.  3. 


CONCLUSION 

For  some  important  power-law  spectral  density  models  often  used  in  characterizing  preci¬ 
sion  oscillators  ( Sy{f )  ~  a  =  -2, -1,0, +1, +2),  we  have  studied  the  effects  on  variances 
when  dead  time  exists  between  the  frequency  samples  and  the  frequency  samples  are  aver¬ 
aged  to  increase  the  integration  time.  Since  dead  time  between  measurements  is  a  common 
problem  throughout  metrology,  the  analysis  here  has  broader  applicability  than  just  time 
and  frequency  regarding  traditional  variance  analysis. 

Heretofore  the  two-sample  zero  dead-time  Allan  variance  has  been  shown  to  have  some 
convenient  theoretical  properties  in  relation  to  power-law  spectra  as  the  integration  or 
sample  time  is  varied  (if  rM,  then  a  =  —pi  —  1,  pi  >  -2).  Since  cry(r),  by  definition,  is 

estimated  from  data  with  no  dead-time,  the  sample  or  integration  time  can  be  unambigu¬ 
ously  changed  to  investigate  the  r  dependence.  In  this  paper  we  have  shown  that  in  the 
asymptotic  limit  of  several  samples  being  averaged  with  dead  time  present  in  the  data, 
the  t  dependence  is  the  same  and  the  a  =  -pi  -  1  relationship  still  remains  valid  for  white 
noise  frequency  modulation  (FM)  [pi  =  -1,  a  =  0),  flicker  noise  FM  (pi  =  0,  a  =  -1),  and  for 
random  walk  FM  (pi  =  +1,  a  =  -2).  The  asymptotic  limit  is  approached  as  the  number  of 
samples  averaged  times  the  initial  data  sample  time  rQ  becomes  larger  than  the  dead  time 
(M  >  r).  The  variances  so  obtained  only  differ  by  a  constant,  which  can  be  calculated  as 
given  in  the  paper. 

A  knowledge  of  the  appropriate  power-law  spectral  model  (which  can,  in  principle,  be 
estimated  from  the  data)  is  required  to  translate  a  distributed  dead-time  variance  to  the 
corresponding  value  of  the  Allan  variance. 
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Table  of  Some  Bias  Function  Identities 


*1(2,4,/!) 
Bi{N,r,  2) 
B^N,  1,1) 
BAN,  1,M) 


Bi{N,r,  — 1) 
Bi(N,r,  —2) 
B2(0,/i) 

b2(i,m) 

B2(r,2) 

*2(r,l) 

B2(r,-1) 

52(r,-2) 


B3(2,M,  1,/x) 
B3{2,M,r,-2) 
B3(2,r,n) 
B3{2,M,r,2) 
B3(2,  M,  r,  — l) 


=  1.000 

=  (JV(fV  +  i))/6 

=  N/2 

=  {N(  1  -  iV'i))/[(2(iV  -  1)(1  -  2^)]  for  /x  ^  0 
=  fV  ln(Ar)/[2(iV  —  1)  ln(2)]  for  /x  =  0 
=  1.000  for  /x  <  0 
JV-l 

=  [2/JV(lV  —  1)]  (JV  —  n)  •  nM  for  n  >  0 

n—  1 

=  1.000  if  r  >  1 
=  1.000  if  r  —  1  or  0 
=  0 

=  1.000 
=  r2 

=  (3r  —  l)/2  if  r  >  1 
=  rif0<r<l 
=  1.000  if  r  >  1 
=  0  if  r  =  0 
=  1  if  r  =  1 
=  2/3  otherwise 
=  1.000 
=  M 
=  1.000 
=  1.000 

=  1.000  for  r  >  1 
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QUESTIONS  AND  ANSWERS 


James  Barnes,  Austron:  I  have  a  brief  comment  on  the  history  of  this.  The  bias  functions, 
Bi  and  J52,  were  published  some  22  years  ago.  We  considered  53  at  that  time,  but  decided 
that  there  was  no  use  for  it.  We  didn’t  want  to  encourage  people  to  do  that.  However, 
there  are  people  outside  the  time  and  frequency  community  who  are  acquiring  data  in  this 
lousy  form.  They  need  to  be  able  analyze  this  data. 

Mr.  Allan:  That’s  a  very  good  point.  The  biggest  need  for  it  is  outside  the  time  and 
frequency  field. 
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